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We consider the problem of minimizing a polynomial function over the 
integer lattice. Though impossible in general, we use a known sufficient con¬ 
dition for the existence of continuous minimizers to guarantee the existence 
of integer minimizers as well. In case this condition holds, we use sos pro¬ 
gramming to compute the radius of a p-norm ball which contains all integer 
minimizers. We prove that this radius is smaller than the radius known from 
the literature. Furthermore, we derive a new class of underestimators of the 
polynomial function. Using a Stellensatz from real algebraic geometry and 
again sos programming, we optimize over this class to get a strong lower 
bound on the integer minimum. 

Our radius and lower bounds are evaluated experimentally. They show a 
good performance, in particular within a branch and bound framework. 

Keywords integer optimization; polynomials; lower bounds; branch and bound 

1. Introduction 

Given a multivariate polynomial / : M” —)• M, we consider its minimization over the 
integer lattice, i.e., the problem 

min f{x) 
s.t. X G Z”. 

This is a special type of a nonlinear integer optimization problem and is incomputable in 
general; Hilbert’s tenth problem asks if there exists an algorithm that decides whether 
for a given polynomial / with integer coefficients the equation f(x) = 0 has a solution 
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X G Z”. Seventy years later it was proved by Matiyasevich |Mat70| that no such 
algorithm can exist. So if there was an algorithm to solve |IP[ we would also get an 
algorithm to decide whether f{x) = 0 has an integer solution by minimizing over Z”. 
Consequently, IP cannot be solved for general polynomials /. In this paper we consider 
a subclass that leads to solvable problems. 


1.1. Outline 


Once the notation, a little background on sos (sum of squares) programming and a 
Stellensatz from algebraic geometry are introduced (Section]^, we review, in order to 
make the problem tractable, a sufficient criterion from the literature for the existence of 
continuous minimizers (Section]^. This criterion actually holds for integer minimzers, 
too: Integer minimizers exist if the highest order terms of / attain positive values on 
M"’ \ {0}, we say that the leading form of / is positive definite. But deciding positive 
definiteness is NP hard, hence we approximate this problem by sos programming. How¬ 
ever, this only tells us that minimizers exists, but not where they are located. We locate 
the minimizers by computing - again using sos programming - the radius of a p-norm 
ball that contains all integer minimizers (Section]^, or simply norm bounds on the min¬ 
imizers. In principle, once a norm bound is known, |I^ is solvable by enumeration. We 
proceed by deriving a class of polynomials with obvious integer minimizer (Section 
that serve as underestimators to /. Using sos programming, we may choose the underes¬ 
timator g with the strongest lower bound. Firstly, we search for a global underestimator 
which is later refined to underestimation on sublevel sets, yielding stronger bounds. This 
refinement further allows to prove that, provided / has a positive definite leading form, 
there always are underestimators in our class that can be found by sos programming. 


To find the optimal solution to IP instead of enumeration, we use an underestimator g 
from our class to obtain lower bounds within a branch and bound approach (Section]^. 
We continue with an experimental evaluation of the norm bounds, of the lower bounds 
and of the performance of the underestimators within branch and bound on random 
instances. The paper ends with a conclusion and ideas for future research (Section]^. 


1.2. Literature review 

The literature on nonlinear integer programming is vast. For an overview, a presentation 
of key techniques and complexity results as well as numerous references for further read¬ 
ing, see the article |HKLWin| , which comes as chapter of j.ILN'*~in] . For a recent survey 
on nonlinear mixed-integer programming (a subset of the variables may be continuous), 
see |LL12] . 

Throughout our work we rely heavily on methods from constrained continuous polyno¬ 
mial optimization. Based on work of Shor |Sho87[ ISS97j . Parrilo |Par00j suggested a 
method now known as sos programming that makes continuous polynomial optimiza¬ 
tion accessible to semidefinite programming (see, e.g., |WSV00| for the latter), whilst 
Lasserre [Las nu published the dual approach, based on moment sequences. Since the 
emergence of the two ground-breaking publications by Parrilo and Lasserre, many results 
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on continuous polynomial optimization via sos techniques and its theoretical background 
have been published: The expository paper |PSn3] shows that existing algebraic tech¬ 
niques are outperformed by the sos method. As in-depth treatments, we refer to |AL12] 
for the interplay of semidefinite, conic and polynomial optimization, and |BPT13] for 
a focus on the geometry involved. For an algebraic treatment, we mention Marshall’s 
book |Mar08j . We point out Laurent’s elegant survey |Lau09j . which treats, among other 
aspects, the duality of the sos and moment approach. 

A special case of our problem, unconstrained quadratic integer minimization, is con¬ 
sidered by |BHS14| . We did not find results in the literature that consider the uncon¬ 
strained integer minimization problem for multivariate polynomials of arbitrary degree. 
Regarding nonlinear integer minimization with constraints or additional assumptions, 
integrality turns even seemingly simple problems incomputable: Using the aforemen¬ 
tioned result of Matiyasevich, Jeroslow |Jer73j proved that there cannot be an algorithm 
for integer minimization of a linear form subject to quadratic constraints. 

But substantial special cases are solvable, for example, every integer problem with a 
bounded feasibile set is solvable. More specifically, an important case is boolean pro¬ 
gramming, see [BH02] for a survey. A classic approach is linearization by introducing 
new variables and constraints (for early results see, e.g., |For6n] ). In theory, also a gen¬ 
eral bounded integer polynomial optimization problem can be reduced to the binary case 
|Wat67] , but this is not practicable since the number of variables grows too quickly. An¬ 
other technique for boolean programming is the reduction to a quadratic problem which 
can be done with significantly fewer variables and constraints |Ros75l IBR07j . Another 
substantial case that gained attention are (quasi-)convex problems, as the incomputabil¬ 
ity results do not hold for this case |Kha831 [KPOH] . |HK13j present a Lenstra type 
algorithm for quasiconvex integer polynomial optimization. 

For integer minimization of arbitrary polynomials, a common way of solving |I^ is branch 
and bound as proposed (originally only for convex functions) by |GR 85j . A popular 
method is to calculate convex underestimators (see, e.g., |LTll] i to obtain lower bounds. 
As a different approach, if the feasible set is a box, |BD14j compute separable underesti¬ 
mators wich give lower bounds that are easy to obtain. In contrast, |LHKW()6] directly 
compute lower and upper bounds, i.e., no underestimators, for nonnegative polynomials 
on polytopes. 

2. Preliminaries 

In this section we collect some basic notation and facts as well as a few theorems from 
algebraic geometry that we use to derive our main results. 
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Notation and basic properties of polynomials 

We write a polynomial / ; —)• M in n unknowns Xi,..., using multi-indices 

a = («!,..., an) G Ng via 

f=j2 E 

oeNff oeNj 

for some unique coefficients Oq, G M, only finitely many nonzero, and monomials := 
• • • X"". The modulus of a G Ng is \a\ = ai + an- With these conventions, 
the degree of / is given by 

deg / := max||a| | Oq / 0} . 

The ring of polynomials in the unknowns Xi,..., Xn is denoted by M[Xi,..., X^], which 
we abbreviate to M[X]. We use X here in order to distinguish the multivariate from the 
univariate case. 

A polynomial / is homogeneous if all monomials in / have the same degree, say d. That 
is, / is homogeneous if / = Yl,\a\=d^o‘X°‘. In this case one has 

/(Ax) = A'^/(x), X G M*", A G M. 

This implies that a homogeneous polynomial is uniquely determined by its values on any 
of the p-norm unit spheres 

:= {x G M” I ||x||p = 1 } , p G [1, 00 ]. 

A homogeneous polynomial / is positive definite if /(x) > 0 for x / 0. Similarly, 
a (possibly nonhomogeneous) polynomial / is positive semidefinite if /(x) >0 for all 
X G M”, for short / > 0 and / > 0. For a homogeneous polynomial / and some p G [1, 00 ], 
we often use the following equivalent characterization; 

/ > 0 3c > 0 : /(x) > c for all x G 

/ > 0 <^=i> 3c > 0 : /(x) > c for all x G ^ ^ 

A homogeneous polynomial is also called a form. Any polynomial / G M[X] can be 
uniquely decomposed as 

d 

j=0 

where d := deg / and the fj are homogeneous polynomials of degree j, called the homo¬ 
geneous components of /. The highest degree component, fd, is called the leading form 
of/. 

Given a vector h G M”", we denote by [/i] the vector resulting of rounding each component 
of h to its nearest integer. Finally, we need the notion of a sublevel set: For a function 
/:[/—;■ M from some set U, the sublevel set of level z G M is defined by 

Cl^{z) = {x G [/ I /(x) < z}. 
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Nonnegativity and sums of squares 

Continuous minimization of a polynomial as well as deciding non-negativity of a poly¬ 
nomial are well known to be NP-hard problems, even if one fixes the degree to d = 4 
|NesOO| . Deciding if g is an underestimator of / means to decide if / — g is nonnegative. 
As this is NP-hard we use a tractable sufficient criterion for nonnegativity in the follow¬ 
ing: We search for a decomposition into a sum of squares, or sos for short, which is a 
sufficient, but not necessary condition for nonnegativity |Mar08j . Formally, a polynomial 
/ G M[A] is a sum of squares if there are ... ,ui G M[A] such that / = We 

sometimes use the following property: 

Lemma 1. Suppose v = uf + ■ ■ ■ + u1 for some given ui,. .. ,Uk G M[X] and ui / 0. 
Then v ^ 0, and 

deg V = 2 max deg Ui. 
l<i<k 

Proof. See, e.g., [MarOSl Cor. 1.1.3]. □ 

The convex cone 

S := |/GM[X] I 3ui,...,ui gM[A] s.t. / = 

in M[A] contains all polynomials / which are sos in M[A]. It is possible to optimize a 
linear form such that affine combinations of the decision variables and given polynomials 
lie in this cone: Such an sos optimization problem or sos program is tractable, as it 
is equivalent to a semidefinite program. For details, see e.g. [ATTl^ IBPT1 3j on SOS 
programming and |WSV00] for semidefinite programming. Formally, an sos program 
has the form 


max 6iyi H-h 

bmiJm 

S.t. £lj0 T T ■ ■ ■ T ym.O'im £ i — 1, . . . , /c, (2) 

yi G M, i = 1,... ,m, 

where y* G M are the decision variables, and G M as well as Oij G M[Xi,..., are 
fixed. In our paper we use sos programming for two purposes: To find an optimal un¬ 
derestimator, see Section and for constrained continuous minimization of polynomials 
as done at the end of Section For both, we use a result from real algebraic geometry, 
known as Putinar’s Stellensatz, outlined next. 

A result from algebraic geometry 

In this section we introduce Putinar’s Stellensatz. See |NS07j and the references therein 
for a discussion and the origins of the Stellensatz. For a finite collection of multivariate 
polynomials S = {yi,..., y^} C K[A], define the semi-algebraic set Ks as 

Ks := {x G I yi(a:) > 0,..., gs{x) > 0} , (3) 
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where our notation follows |Mar08| . The Stellensatz we consider gives a sufficient condi¬ 
tions which allows to construct every polynomial / G M[X] with / > 0 on Ks from the 
given inequalities gi{x) > 0. To this end, for S as above, the quadratic module generated 
by S is given by 

:= (To,... ,0-s G s| (4) 

where qq := 1. For the Positivstellensatz to hold we need Ms to be Archimedean. This 
is the case if there is a polynomial q G Ms such that the set = {x G M” | q{x) > 0} 
is compact. 

Theorem 2 (Putinar). Let Ms be Archimedean and f G M[X]. Then f{x) > 0 for all 
X G Ks implies f G Ms- 


Lower bounds for constrained continuous minimization 


In Section we will see that in order to decide existence of minimizers, we need to 
compute a lower bound on the minimum of the leading form on the sphere As 

the sphere is semi-algebraic for even p, sos methods can be applied to find such a lower 
bound. 


(5) 


In the following we describe how lower bounds on 

min f{x) 
s.t. X G Ks 

where Ks defined as in Q can be derived by sos-programming. The method we outline 
follows Schweighofer [SchOhj , based on Lasserre’s |Lasni) work. We consider a hierarchy 

Qfc 


A; = I, 2,..., of SOS programs 
max yi 


s-t. / - yi - G S 

i=l 

deg{aigi) < k, 

(Ti G S, 

yi G K. 


i = 1, 
i = 1, 


,•5 

,•5 


(Qfc) 


In Qfc, the decision variables are yi G M and the real coefficients of fii,..., CTs G R[A]. 
We then have that every feasible solution yi to Qfc gives a lower bound on (§ , i.e., on 
min{/(x) I X G Ks}- Indeed, if yi is feasible, there are (Tq, ... ,cJs G S, deg{aigi) < k for 
i = I,..., s, such that 

S 

f = yi+ao + '^ Cfigi 

i=\ 


f{x) =yi+ cro(x) -F ai{x)gi{x) >yi, x € Ks, 


i=l 
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as (Tj G S, hence dj are nonnegative, and gi{x) > 0 on Ks by definition of Kg- Hence / is 


bounded from below by yi on Ks, i.e., every feasible solution to is a lower bound on 
([^. A justification for the ansatz is the following well-known and easy consequence 
of Putinar’s Positivstellensatz (Theorem]^: 

Corollary 3. L et Ms he Archimedean. Denote the minimum of S by f* and the 
minimum of Qk by y[^^. Then y[^^ f* for k 


oo. 


Although finite convergence is not guaranteed [Las nn, there are cases where an optimal 
solution X G Ks to ([^ can be extracted from , see e.g. |HLn5j . In the uncon¬ 
strained case min{/(x) | x G M”} given by s = 0 (i.e. Ks = M”) in ([^ even more 

is known: Instead of solving |Qfc| with respect to Ks = M"' which would be given as 
max{7/i \ f — yi G S}, one can consider the gradient variet}Q resulting in 2n con¬ 
straints corresponding to the equations dx^ / = ... = dx„ / = 0 and solve with 

respect to 

S' = {dxj ,.. ■,dx^f, -dxj,-dx^f}. (6) 

Then we have: 


Theorem 4 (jNDSOGj). Consider the set of polynomials of degree at most d G No that 
possess a global continuous minimizer: 


'■= {/ £ I deg(/) < d and G M” s.t. f{x*) = f* = inf f{x)}. 

Then, for the sos-programs with gradient variety constraints S' from finite con¬ 
vergence holds for almost all polynomials f G Td- More precisely, there is a ko G No 
s.t. for the optimal solutions of one has y[^'^ = y[^°^ = f* for k > k^. Moreover, 
a minimizer x* of (§ can then be extracted. 


3. Existence of minimizers: sufficient and necessary conditions 

Before we search for integer minimizers of a polynomial / G M[A], we review sufficient 
and necessary conditions to decide whether integer or continuous minimizers exist at 
all. For nonconstant univariate polynomials, this is equivalent to an even degree and 
a positive leading coefficient, which is in turn closely related to the behavior of f{x) 
as |x| —>• oo. For multivariate /, the situation is similar once we decompose / into its 
homogeneous components (see Section]^ for the definition). A positive definite leading 
form is a sufficient condition for the existence of continuous minimizers whilst positive 
semidefiniteness is a necessary condition [MarOBl IMarOQ) . In our next result we show 
that this holds for integer minimizers as well. Together with some observations that will 
be of use later on, these results are reorganized in the following proposition. 


^In case of unconstrained continuous minimization, provided minimizers exist, restricting minimization 
of / to the subset of R" where the gradient vanishes does not change the set of optimal solutions. 
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Proposition 5. Let f G M[X] with deg / = d > 0. The following implications hold: 

/d > 0 => all C{^{z) compacts f has i.m. inf f{x) > —oo fd > 0 ^ d even 

— xSZ" 

t t 

liminf f(x) = +cx) ^ f has c.m. ^ inf f(x) > — oo 

|a;|^+oo xeM" 

where i.m. abbreviates integer and c.m. continuous minimizers. In addition, none of 
the implications above can be strengthened. 

Proof. Let fd > 0, and c* := min^ggn-i /j(x), j = 0,...,d, where c*j > — oo for 
j = 0,..., d — 1 by compactness of the sphere and > 0 since fd is positive defi¬ 
nite. For 0 7 ^ X G M”’ and p G [1, oo] this means 

fix) = ^ 

j=0 3=0 3=0 

The expression on the right can be considered as a univariate polynomial in ||x||p with 
positive leading coefficient, so liminf| 3 ,|^_|_oo f{x) = -|-oo follows. If the latter holds, the 
sublevel sets must be bounded for all z G M. As / is continuous, the sublevel sets 

are moreover closed, and compactness follows. In case the limit inferior is s G [—oo, -|-oo), 
pick z G (s,-|-oo). There must be a sequence Xk G M", ||xfc||p —)• oo as /c —>• oo, such 
that f{xk) < 2 for all k. Put differently, x^ G T{^iz) for all k, hence the level set 
is unbounded, and the only equivalence in the diagram is proven. We show the two 
rightmost implications in the first row next, the remaining ones are straightforward. So 
suppose there is x G M" such that /d(x) < 0. By homogeneity, we may assume x G 
By continuity, there is a whole neighborhood IT of x s.t. fd{y) < 0 for all y G IT. As 
IT n 7 ^ 0, there is a point r G IT n with rational coordinates r* = ^, Zi G Z, 
rij G N, i = 1,..., n. Now for all A G M, 


d 

/(Ar) = ^/j(r)A^ 

3=0 

and since /d(r) < 0, we have /(Ar) —>• —oo as A —)■ oo. Since r* = ^, i = 1,..., n, 
there is a lowest common denominator 1 G N of the r*. For A: G N, we have especially 
fiklr) —>■ —oo as /c —)• oo. But since klr G Z”, / is unbounded from below on 
For the last implication, let d be odd. As fd is a nonzero polynomial, there is x G M"' 
s.t. fd{x) 7 ^ 0. Homogeneity of (odd) order d implies either fdix) < 0 or fdi—x) < 0, 
therefore fd is not positive semidehnite. 

Finally, a collection of counterexamples proving that none of the implications of the 
proposition can be strengthened can be found in |Behl3) . □ 
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In the following, we rely on the sufficient condition /^ > 0 to ensure the existence of 
integer minimizers. As deciding nonnegativity of fd is NP hard, we compute a lower 
bound Cd on the leading form fd restricted to the sphere, i.e.. 


^ * 

Cd<Cd = 


mm 


fd{x) 


(7) 


If > 0 we know from Q that fd > 0, so integer minimizers exist by Proposition 
Our approach fails if < 0 unless we find a point x G s.t. fd{x) < 0 which certifies 
that fd is not positive semidefinite, and hence / cannot have minimizers. 

4. Norm bounds on the minimizers 

4.1. A new bound on the norm of integer minimizers 

If / = satisfies = min^ggn-i fd{x) > 0, i.e., fd is positive definite and a 

lower bound Cd on the minimum with 0 < is known, it is possible to give a 

bound i? > 0 on the norm of the continuous minimizers. We only found one bound in 
the literature, which assumes p = 2, 


^ d-l 

Rut := max [ 1, — 

^d . 
J=1 


tj\\i = max 



( 8 ) 


0<|a|<d 


from Marshall where ||/||i := l®ol f®^ / = h is a special case (empty 

constraint set) of a more general result |Mar03j . Laurent |Lau09j gives a more elementary 
proof for Marshall’s bound Q by showing /(x) > /(O) for ||x ||2 > Hence Rm gives 
a valid bound on integer minimizers as well. However, for non-sparse polynomials, 
this bound may get quite large. Within branch and bound approaches it is crucial 
to find a small bound R to reduce the number of feasible solutions - scaling R hy a 
constant C > 0, the number of integer points that satisfy the norm bound scales with 
a factor of (roughly) We hence suggest a different approach: In the following 

theorem, we still compute i? > 0 with f{x) > /(O) for ||x||p > R, but instead of 
bounding all homogeneous components simultaneously, we compute constants Cj such 
that Cj < c’j = min^ggn-i fj{x) on a suitable sphere 

Theorem 6. Let f G M[A)] with deg / = d > 0. For a fixed p G [l,oo], let Cj G M s.t. 
fj{x) > Cj for all X G j = l,...,n. Suppose Cd > 0. Let R denote the largest 

nonnegative real root of the univariate polynomial q : M —>■ M, 


i=i 

1. Then, integer as well as continuous minimizers x' of f (do exist and) satisfy 

||a^^||p < R- 
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Let X* he any of the integer minimizers. 

2. We have\xl\ < yR\, for i = 1,... ,n. 

Proof. We prove , the other assertion follows directly from integrality of x*. By com¬ 
pactness of the sphere, every fj is bounded below by some Cj G M. We observed in Q 
that Cd > 0 implies positive definiteness of fd, hence integer and continuous minimizers 
exist and d is even (Proposition]^. Using homogeneity, 

f{x) - /(o) = '^fj{x) = Yjfj = ^(ll^llp) (9) 

i=i j=i j=i 

for X / 0. Since q is univariate and of degree d > 0, it has at most d real roots. As 
g(0) = 0, q has roots in [0, oo), and we denote the largest of them by R. As before, 
Cd > 0 yields lim^^+oo qW = +oo. This together with the intermediate value theorem 
implies g(A) > 0 for A > i?. Thus, eq. Q forces f{x) > /(O) for ||x||p > R. □ 

Remark 7. The larger the cj the smaller the resulting norm bound R. Formally, let 
q = CjA-^ , q = CjA-^ , such that Cj > Cj, and call the largest nonnegative roots 

R and R, respectively. Wlog, it suffices to consider the case that cj = Cj for j k and 
Cfc > Ck for some k G {1,..., n}. Now q{X) — q{X) = {ck — Ck)X^ > 0 for A > 0 and by 
assumption on Ck, Cfe. Thus, q{X) > q{X) for A > 0, hence R < R - unless R = 0. In this 
case R = R = 0. 


Before we present different methods of computing valid Cj, we compare R and Riit. In 
the experiments in Section we show that our norm bound R is drastically smaller 
than Also, it can be proven that our norm bounds are never larger and, except for 
special cases, are actually strictly smaller than the bound from the literature. 

Proposition 8 . Let f with deg/ = d > 0 and Cd > 0 such that fd{x) > Cd for all 
X G j = 1) • • ■) Compute R G [0, oo) as in Theorem^for 

Cj :=-||/il|i, J = 1, (10) 

and compute Rut G [l,oo) as in Then R < Rut- If moreover d > 2 and there is a 
coefficient Oq 7 ^ 0 of f with |a| < d — 1, then R < Rm for Rf^l and R = Rut for R = 1. 


Proof. At first we observe that the numbers Cj = — 
bounds, for general p G [l,oo]: As ||x||p < 1 implies 
has 


II/: 


Jill in ( 10 ) are indeed valid lower 
a^lloo < 1 and hence |x“| < 1 , one 




^ Oax" > ^ -|aa||x“| > ^ -|a„ 
l«l=i l«l=i H=j 


WfjWi — Cj, 


X G § 


n—1 

P 


We prove the case d > 2 and Oq, 7 ^ 0 for some a with |q;| < d — 1. The claim obviously 
holds in case R < 1. For the cases R = 1 and R > 1 , define q{X) = Cj^'^ as before 

and let q{X) = CdX'^ + (^X]j=i Cj^ A'^“^. Then we have 

q{X) > q{X) for A > 1, q{X) = q{X) for A = 1 (11) 
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as Cj < 0 for j = 1 ,..., d — 1 and one < 0 for some k G { 1 ,... ,d — 2} by the 
assumption on Oq. By definition, the largest nonnegative real root of q is R, and the 
largest nonnegative real root i? of g is 


- d—1 


Cd 


i=i 


1 

Cd 


E 

0<|Q:|<ci 


and, by definition, = max(l,i?). If i? = 1, we infer from (11) that 0 = q(l) = q{l), 
so i?iit = 1. In case i? > 1, we infer from © that 0 = q{R) > q{R), so R < Rm as 
q{X) —>■ +00 for A —)> +oo. The proof for the two remaining cases, d = 2 or all Oq = 0 
for |a| < d — 1, is similar as g = g in these cases. □ 


We now present different ways of computing bounds Cj on c* = min^ggn-i fj{x). 

1. We saw in the proof of Proposition that Cj = — ||/; ||i gives valid lower bounds 
for any p G [l,oo]. However, this bound is rather rough and only useful for the 
lower order forms, that is those fj with j < d. 


2. The arguably easiest way to find such Cj by sos programming is to minimize fj 
on the sphere More specifically, for p G 2N, the constraint ||x||p = 1 is 

equivalent to the constraint ~ which is semi-algebraic for even p > 0. 

The hierarchy Qk with gi = 1 — XlILi 52 = ~ ^ can be rewritten 

as 

max yi 


s.t. fj-yi-q - ^ ^ 

q G M[X], degq < k 
yi G M 


( 12 ) 


where we used + 0292 = (o'! —(y 2 )gi = qgi, some q G M[X], as any polynomial 
can be written as the difference of sums of squares, e.g. using Aq = {q+lf^ — {q—lY. 


3. A different lower bound on the leading form can be computed via the program 

n 

max 7 s.t. fd - 1 ■'^Xf (13) 

from |Niel2| choosing p = d. 

4. We present two refined approaches of item in the Appendix: As a first step, we 
replace the underlying estimate ||x"|| < 1 by ||x"|| < ||x“||, where x is a maximizer 
of x" on the sphere. In a second step, considering all orthants separately allows 
then to furthermore get rid of approximately half of the terms. 


Remark 9. If p G 2N, the set Ms with S = {1 —^ is Archimedean. 

Hence, from Corollary]^ the optimal objective values of ( [I 2 I ) converge, for k —)• 00 , to 
c* = min^ggn-i fj{x) - which are, by Remark ItI the best possible bounds Cj. 
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4.2. Application to systems of polynomial equations 

In this section, we consider an application to systems of polynomial equations; we test 
our approach on random instances of polynomials in the next section. It is a common 
approach to solve a system of equations gi{x) = 0, i = 1,..., s, with solutions restricted 
to, say, X G Q” or W^, by minimizing f = Qi + ■ ■ - g^ over the integers, rationals or 
reals, respectively. If the minimum is 0 at some x, the equations have a solution at x; if 
the minimum is nonzero, there cannot be any solution. 


4.2.1. Diophantine equations 


As an example, does the system 

—3xf + xfx 2 — xf + 2xiX 2 + xi — 2x2 “ 2x2 + 4 = 0 

2x2 + X 1 X 2 + 4x2 — 5 = 0 


possess an integer solution? Denote the polynomials in Z[Aii, A 2 ] on the left hand side 
in the first and second equation by gi and 52 , respectively, and consider / := + 52 - 

The homogeneous components of / are bounded from below on Sg by 


(ci,..., ce) = (-60.49, -13.03, -41.76, -7.85, -24.45, 2.59), 


we found the values by solving (12) numerically. The univariate polynomial q{X) = 

integer 


Ylj=i roots: 0 and R ^ 9.90. Thus, by Theorem 

minimizers exist and must be in the box [—9,9]^. Iterating over all integer points in the 
box one finds /(xi, X 2 ) = 0 at (xi, X 2 ) = (—1,1). From the perspective of number theory, 
our method provides search bounds on solutions of a system of Diophantine equations if 
the leading form of / = 9 j i® positive definite. 


4.2.2. Bounds on algebraic varieties 


Similarly to the systems of Diophantine equations, our bounds apply to real algebraic va¬ 
rieties: Given gi,... ,gs G M[A], the variety of the gi is V{gi,, gs) = {x G M” | 51 (x) = 
0,... ,gs{x) = 0}. If the leading form of / = 9j i® positive definite, we may give 

a norm bound on all points of the variety. As an example, let us consider the system 
from [CLOn71 Example 2, Sec. 2 § 8]: 

x^ + = 1 (14) 

2 , 2 

X + z =y 


X = z 
x,y,z G C 


Computing the Cj by solving (12) for p = 2 yields (ci,..., C4) = (0, —2.0, —0.77,1.0) and 
gives us 7? ~ 1.86 as a 2- norm bound on all points in the variety. It is known that the 
variety consists of exactly four points: The system has two real and two complex solutions 
{x,y,Zi) with Zi G +\/5 — 1}, where the real solutions suffice ||(x, y, 2;)||2 = 1 by 

0- We conclude that in this case our bound is not far off. 


12 









5. A class of underestimators 


5.1. Global underestimation 

Now let /, g : M” —)> M. We then have 

(Vx G M"' : g{x) < f{x)) => inf g{x) < inf f{x) (15) 

where inf^^gz" dix) gives a stronger bound on the integer minimum of / than infa;g]Rn g{x). 
Using the integer minimum of g to derive a lower bound on the integer minimum of / 
makes only sense if integer minimization of g is easy compared to integer minimization 
of /. We motivate our class of easy-to-minimize underestimators g with an observation 
on monomials with a shift in the argument which shall serve as the building blocks to 
the more general underestimators. 

Observation 10. For some h G M” and a G Nq, let 

n 

g = {x-hr = Ylix, - 

i=i 

he a shifted monomial. If all a* are even, g has a continuous minimizer at h and an 
integer minimizer at [/i]. If one ai is odd, g is not bounded from below and does not 
have continuous or integer minimizers. 

Our underestimators are conic combination of shifted monomials with even aj, j = 
1,... ,n, as the combinations inherit the integer minimizer [h]. More precisely: 

Proposition 11. Let a polynomial g G M[X] he given as g = Yla^rt{X — h)‘^°‘ with 
ba^ 0 for a / 0, and h G 

1. The restriction of g to x^xnr=fc+i{ Xi} that is, the univariate function 

y I—>■ ^(xi,..., Xk-i, y, Xfc+i,..., Xn) for fixed x G M” is nonincreasing for y < h^ 
and nondecreasing for y > hk, A: G {1,..., n}. 

2. We have g{xi,. ..,Xn) > g(xi,... ,Xk-i, [hk],Xk+i, ■■■ ,Xn) for every x G 

3. h is a continuous and [h] an integer minimizer of g. 

Proof. The claimed properties hold for every term {X — Thus they hold for conic 
combinations of such terms. □ 


Three properties make these polynomials g useful underestimators: integer minimization 
is trivial, and all nonlinearity is confined to the parameter h. Also, the fact that the 
expression is linear in the ba makes them accessible to optimization. Proposition 11 
motivates 


13 



Notation 12. We denote the set of conic combinations of monomials with a shift of h 
by 


^{h) := {ge M[X] 


g = ^ ha{X — ha G K>o for all a / 0, J C Nq finite > . 


aeJ 


As an example, the polynomial 

g={Xi- 1.5)^(A2 - 2f + 0.3(Ai - 1.5)2(A3 - 3.2)® - 1 E ^(1.5, 2, 3.2) 


with J = {(4, 6 , 0), (2, 0, 8 ), (0, 0, 0)} has an integer minimizer at (1, 2,3). 
Proposition 13. Let g E ^(/i) satisfy g{x) < f{x) for all x E M”. Then 


g{[h~\) < inf f{x) 


Proof. This follows from (15) and Proposition 11 


□ 


For determining an underestimator g we still have to choose h and the coefficients 
This is described next. 


Choice of h: In principle, every h E M” may be chosen. Heuristically, we chose an 
(approximate) continuous minimizer of / since g has its continuous minimizer at h. In 
fact, every nontrivial g looks like an elliptic paraboloid or a parabolic cylinder near h, 
as does / near every local minimum. For almost all /, the continuous minimizer of / 
can be found using sos methods (Theorem]^. 

Choice of ba- We choose the ba so that the lower bound ^([/i]) is maximized. In other 
words, we wish to maximize the expression 

aeJ 

subject to g < f. The higher order terms in g ensure a certain aggressiveness in the 
growth behavior away from h, even for small coefficients ba, which leads to strong bounds. 
Using the notation Wa ■= (L^l ~ we get the following optimization problem: 

max > Waba 
J, 6a 

a&J 

s.t. /(x) — ^ 60 ( 3 ; — h)^" > 0 Vx E M"' 

a&J 

ba>0 for a 7 ^ 0 

with decision variables E M, a E J and J C Nq finite. Since this program is not 
tractable in general, we consider the following sos version instead: 
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y = max ^ w^ba 


(GLOB) 

aeJ 

s.t. /-V6 «(X-/i)2“ 

is SOS in M[Xi,. 


aej 

ba 

is SOS in M[Xi,. 

., Xn] for a 7 ^ 0. 


The decision variables are the real ba, a ^ J. Note that 6 ^ G S is equivalent to ba > 0. 
Once J is fixed, GLOB| is a valid sos program. We show in Corollary that it is 
sufficient to choose J = {a G Ng | |q:| < deg(/)/2}. 


In the following we identify a solution a G J, with the polynomial g it defines, that 
is with g = ~ hence may say that a polynomial is a feasible or 

optimal solution to GLOB We note that every feasible solution to GLOB| (for any choice 


of h) gives valid lower bounds on IP 


Theorem 14. Let f G M[X], h G M” and g 
solution to [GLOB for some J. Then 


— h)"^^ G 'T^{h) he a feasible 


1 - 9{[h]) < infa,6Z" f{x). 


If moreover f — f{h) G S holds and g is an optimal solution to GLOB, then 


2 . g{[h'\) > f{h). 


Proof. Claim holds as g being feasible to GLOB| implies f — g £ Ti, hence f — g > 0, 
and the claim follows by Proposition |13[ Concerning Claimj^ observe that f — f{h) G B 
implies that h is a continuous minimizer of / and that the constant polynomial g = f{h) 
is a feasible solution to|GLOB[ hence g{\h~\) > g{\h~\) = f{h) for every optimal solution 
g G ^{h). □ 


5.2. Improving the underestimators 
Motivation 


A quite restrictive condition in GLOB is that it requires g{x) < f{x) globally, i.e., for 
all X G M”". Actually, this is not necessary for our purposes. It is enough to require 
g{x) < f{x) only for those x G M”" that satisfy /(x) < f{q) for some q G Z"". That is, for 
all q £ TP, we have 


(Vx G C{if{q)) : gix) < fix)) inf g{x) < inf /(x). 


(16) 


in other words, the integer minimum of 5 is a lower bound on the integer minimum of 
/ even if g is an underestimator of / only on a sublevel set C^if{q)). If we make use of 
this in our sos program, the lower bound can only improve. 
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But before we delve into the details, let us consider the potential payoff by taking a look 
at the example in Figure [Ta] The plot depicts the univariate polynomial 

/ = 0.2 • (X - 0.3)® - 5 • (X - 0.3)^ + 32 • (X - 0.3)^. 

along with two underestimators 5 glob, 9sls- A short calculation shows that / has five 
local extrema at 0.3 and 0.3 ± that the local minimizers are at x = 0.3 

and at x± = 0.3 ± y^ 25+vT^ q ^ 3 5 ]^^ Considering that / has a positive definite 

leading form, one of the local minimizers must be a global one, and comparing the 
function values shows that x = 0.3 is the continuous minimum. Moreover, / must have 
its integer minimizer in [—3,3] as min{/(x+),/(a:_)} > /(O); comparing the function 
values shows that / has a single integer minimizer at x = 0 with value /(O) ~ 2.84. The 

is given 


underestimator ^glob £ computed as optimal solution to GLOB 

ffGLOB « 8.71 • 10"^^ • {X - 0.3)® +1.09 • 10"°^ • {X - 0.3)^ + 0.75 • (X - 0.3)^ - 1.22 • 10■°^ 
is globally below /. To hnd an underestimator on a sublevel set, we first fix the level 


z = f{q) heuristically. Note that any g G Z is a feasible solution to IP and hence an 
upper bound; any integer minimum must be contained in JZ^{f{q)). As h = 0.3 is the 
global minimizer, we choose (7 = [/i] = 0 here. The polynomial g'sLS; given by 

gsLS ~ 9.09 • (X - 0.3)® + 11.80 • (X - 0.3)^ + 39.36 • (X - 0.3)^ - 0.81, 
is an underestimator on the sublevel set £<(/(0)) = [0,0.6], as can be seen in Figure 


lb It will be shown in the next section how this function can be found. The plot 
reveals the shortcomings of global underestimation; Any global underestimator in '^(0.3) 
cannot go above the local minimizers of /. This “barrier” from above turns 5 glob in 
this example essentially into a quadratic underestimator for small x as the ratio of the 
higher order coefficients and the one in front of the quadratic term is of order 10“^®. The 
underestimator ^SLS however is a degree 6 polynomial whose higher order coefficients 
are not small at all. Note that ^glob is much closer to / near 0.3 compared to the new 
underestimator ^SLS- However, the quality of the resulting lower bound depends on the 
function values at 0 and there g'sLS is closer to / than (7 glob- The lower bounds the 
two underestimators provide are ^globCO) ~ 0.07 and 5 sLs( 0 ) ~ 2.84. In this case, we 
are lucky as the lower bound on the integer minimum and /(O) coincide, showing once 
more that / has its integer minimizer at 0 . 


The SOS program for computing the improved underestimator 

How do we compute the improved underestimator? At hrst, we observe that every 
sublevel set z G M, of / is semi-algebraic. Indeed, with the notation from (3) and 

S := {z — /}, we have 

£{{z) = {x G 1 z - fix) > 0} = Kg. 


For this example we solved GLOB for h — 0.3 and degg = 6, using SOSTOOLS 3.00 and CSDP 6.1.0. 


16 















(a) From far 

Figure 1: Global underestimator (7 glob and an underestimator ggus on a sublevel set. 


Moreover, C^{z) is compact if the leading form of / is positive definite (see Propo- 
n. f 

sition 5). Compactness of in turn implies that the quadratic module Ms C 

R[Xi,Xn] generated by 5 := {f - g,z - /}, for any g € M[Xi, ..., Xn], is thus by 
definition Archimedean. Hence, for every feasible underestimator g G ^(h) the existence 
of a representation for f — g asin Putinar’s Positivstellensatz (Theorem]^ is guaranteed. 
This motivates the following program: 


= max Waba 

(SLS) 

a&J 


s-t. f -'^baiX - hf'^ - a{z - f) 

is SOS in M[Ai,.. ., X„], 

aeJ 


ba for a / 0, (T 

are sos in M[Xi,..., Xn], 

degcr < k 



The decision variables are the real ba as for GLOB and, additionally, the real coefficients 
of the polynomial cr. As before, we use the notation Wa ■= {[h] — 

SOS program once J and the degree of a are fixed. 


SLS is a valid 


Theorem 15. Let f G M[A], h G M” and g G 
> /(<?) for some q G Z". 


be a feasible solution to SLS with 
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1. Then gilh]) < inf^^zz^ f{x). 

2. If J is fixed, < y^^'> < < • • 0 

3- If fd is positive definite, there is /cq £ No such that SLS is feasible for all k > ko. 


4- |SLS with k = —oo is GLOB 


5. If f — f{h) G S and g is optimal, then g{[h~\) > f{h). 

Proof. Statement!^ holds as g feasible implies f—g — a{z — f) G S. Hence f{x)—g{x) > 0 
for those x with f{x) < z, especially for those x with f{x) < f{q) as f{q) < 2 ; by 
assumption. The claim follows by (16). 

Statement is clear as we only allow more coefficients for a. 

To see Statement note that Il{^{z) is nonempty as 2 : > f{q) and moreover compact 

(Theorem sL so f{x) > c for some c G M and all x G C{^{z). Hence f — c £ by 

Putinar’s Ksitivstellensatz (Theorem]^. This means f — c = ctq + a{z — f) for some 
SOS iTo,cT G M[X]. Thus g := c\s a, feasible solution, and /cq := deg a. 

To see Statement |4l we note that k = —00 corresponds to a = 0, in which case [SLS] is 

irrmBi 

Statement is a consequence of Statements and and Theorem [T^ □ 


We have not yet addressed the degree of g in [GLOB] and SLS nor the degree of a in jSLS 


The following proposition shows that once the degree of cr in |SLS is fixed, the degree of 
g in any feasible solution is bounded from above in terms of deg / and degcr. 

Proposition 16. Let f G M[X], g £'^{h) with deg / > 0, deg 5 > 0, 2 ; G M and cr G S 
such that 

f - g - a{z - f) is SOS. (17) 

Then 

deg{g) < deg(/) + max{deg((T), 0 }. 


Proof. Eq. is equivalent to f — g — a{z — f) = ctq for some (Tq G S, or 

g + cro = f{l + a) - za. 


(18) 


Hence deg( 5 ) < max {deg( 5 r), deg(cro)} = deg( 5 r + (Tq) ^ deg (/(I + a) - za) 

® max {deg (/(I + a)), deg( 2 ;f 7 )} deg (/(I + a)) 
deg(/) + deg(l + a) deg(/) + max{deg((T), 0}. 

As g — g{h) G S and deg 5 > 0, equality (I) follows from Lemma Equality in (H) 
follows from eq. (18). Using deg/ > 0, the equalities in (HI) and (IV) follow from 
a typical degree argument: It u,v G M[V], degrt / deg?;, we have n + f / 0 and 
deg(u + v) = max(deg?i, deg?;). Equality in (V) holds as the degree is multiplicative, 
(VI) follows easily if one distinguishes the cases cr = 0, cr G M>o and degcr >0. □ 


^Note that every sos polynomial cr 7^ 0 has even degree. 
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Corollary 17. Let g G ^{h) be a feasible solution for GLOB, Then degg < deg/. 

Proof. Use Proposition |16| with a = 0 and the result follows from Statement of Theo¬ 
rem [T^ 

□ 


6. Implementation and results on random instances 

6.1. Experimental setup 

To evaluate our results, we ran computer experiments: For a fixed number of variables n 
and an even degree d, we created instances of random polynomials 

/= ~ 1) i.i.d.. (19) 

|a|<cZ |Q:|<d 

As we are only interested in polynomials with positive definite leading form, we restricted 
ourselves to those polynomials that satisfy 


O(d,0,...,0) > 0) “{0,(i,0,...,0) >0) • • •) “{o,...,o,d) > 0 (A) 

since a polynomial with at least one of these coefficients nonpositive cannot be positive 
definite. Then, we solved program (12) with k = d + 2 io compute a lower bound 
Cd on min^ggn-i fd{x) to determine whether / indeed has a positive definite leading 
form. If Cd < 0, we discarded the instance, else we know that fd is positive definite. 
In the first part of the experiments, for every tuple (n, d) with n = 2,3,4 and d = 
2,4,6,8,10, we created 1000 random instances of polynomials that satisfy condition 
In Figure we plot how many of these have been detected to satisfy /d > 0. As 
d and n increase, the probability of positive definiteness should decrease - as, loosely 
speaking, more (independent) random variables Oq, simultaneously influence the result - 
which is reflected in the plot. We then use these instances to evaluate the norm bounds 
(see Section 6.2). In the second part of the experiments, for four tuples (n, d), we again 
generated polynomials according to (19) and took the first 50 of them that were detected 
to have a positive definite leading form as input for the optimization problem which is 
in turn solved by branch and bound (see Section 6.3). 

We use MATLAEQ 2014b 64-bit, SOSTOOLS 3.00 |PAV+13) to translate the SOS pro¬ 
grams into semidefinite programs and CSDP 6.1.0 [Bor99| to solve the latter. The experi¬ 
ments were conducted on GNU/Linux (Ubuntu 12.04) running on 2 Intel® Xeon®X5650 
CPUs (each 6 cores) with a total of 96 GB RAM. 


^MATLAB is a registered trademark of The MathWorks Inc., Natick, Massachusetts 
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Figure 2; Instances with detected positive definite leading form. 


6.2. Evaluating the norm bounds 

Once positive definiteness is certificated by some Cd > 0, the bounds on the norm of the 
minimizers can be computed. We summarize the steps to compute a norm bound on the 
minimizei0in algorithmic form (Algorithm [^ . 

For n and d as described above, and each of the 1000 randomly created polynomials 
that has been detected to have a positive definite leading form, we computed the bound 
from the literature from eq. Q and our new bound R (Theorem such that 
\\x '\\2 < R) holds for every continuous and integer minimizer of /. Figure]^ 

depicts a selection of d and n: Those with smallest degree, d = 2, and the maximal 
degree d < 10 such that we still detected some instances with fd > 0. We conclude that 
for the quadratic case d = 2, our approach does not yield significantly better results. 
However, for a higher number of variables and d > 4 we outperform the classic norm 
bound on all instances. Most prominently of this selection, for (n, d) = (3,8), we are 
better by a factor C = Ri^t/R of 50 throughout and in some instances we are better by 
a factor of (7 ~ 100. This means the number of feasible solutions decreases by a factor 
of up to C"" ~ 100^ in this example. 

6.3. Evaluating our underestimators within branch and bound 

We evaluated the underestimators in a branch and bound framework. Firstly, we present 
an algorithm that shows how special properties of our underestimators can be exploited 

®In the algorithm, we abbreviate integer minimizer(s) to i.m. and continuous minimizer(s) to c.m.. 
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Algorithm 1 Norm bound on minimizers 

input / G M[Ai,..., Xn] with deg / G 2N, parameters p G 2N, k^ax £ No 
A: ^ 0 


Cd < -oo 

X ^ NULL 

while k < /cmax and < 0 and x = NULL do 
solve program (12) for j = d and parameter k 
Cd <— optimal value 

if optimal solution can be extracted then 
X <r- optimal solution 

end if 
k k + 1 

end while 

if Crf < 0 and x ^ NULL then 
output X 

print fd{x) < 0 so / has neither i.m. nor c.m.. 
else if Crf < 0 then 

print Cannot decide fd>0 for k < k^^ax- 
else // Cd > 0 in the following 

print / has integer and continuous minimizers. // fd> 0 by & 

for j = 1,..., d — 1 do 

Cj max of ( 10 ) and // can be improved by also taking (2^ into account 

end for 

define (7 : M —>■ M, q{X) = 

R ■<— largest root of g in M 

output R 

print The minimizers x' suffice \\x'\\p < R- 
end if 


// Proposition 


// R>0 by Theorem^^ 
// Theorem^ 
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Figure 3: Bounds on the norm of minimizers for different dimensions n and degrees d. 


to speed up branching and pruning. In the actual experiments, we generated polynomials 
according to (19), where we restricted ourselves to the tuples (n, d) = (2,4), (2,6), (3,4) 
and (4, 2) to keep the problem size tractable and to have an acceptably high ratio of 
positive definite polynomials (compare Figure [^. We generated random polynomials 
until we had 50 that were detected to have a positive definite leading form and which 
were then used as input to the optimization problem. In the following we present an 
evluation of the initial lower bound and a runtime comparison with other lower 

bounds from the literature. 


6.3.1. Algorithm 

Our branch and bound framework is depth first. This keeps memory usage small 
and allows us to quickly obtain good feasible solutions. We do not reorder the vari¬ 
ables. Subproblems are collected in a list £; every subproblem "P G £ is of the form 
V = {m,ri,... ,rm), where m G {0, ...,n} encodes the number of fixed variables 
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(ri,... ,rm) G i.e., 


min f{ri,...,rm,Xm+i,---,Xn) 
Xm+li ■ ■ ■ 1 Xn G Z 


{'P = {rn,ri,... ,rm)) 


and (0) encodes the initial problem. Algorithm 2 states the whole procedure. 


Algorithm 2 Branch and Bound 


4: 


input / G R[Xi,... ,Xn], h G 
X* <— [h] 

u ^ f{x*) 

C ^ {(0)} 


p-norm bound R on minimizers, k G 2No 

// initial guess for integer minimizer 
// upper bound on integer minimum 
// initial list of subproblems 


do 


find underestimator g: solve SLS with h, degg < deg cr = k 

while £ 7 ^ 


//or GLOB , resp 


8 : 


pick V = {m, ri, 

C^C\{V} 


G C with m maximal 


12 : 


16: 


if 


m < n 
L ^ 


then 

f/WP 


r/P 


m 


\P 


let 5 : M -)■ M, 5(Xm+l) = 5(^1, ...,rm, Xm+l, [hm+2\ , • • • , [hn\ ) 
if g{[hm+i~\) < u then // otherwise prune 

hnd Li G [—L, L] 0 Z minimal with g{Li) < u 
if such an Li exists then 

find L2 G [—L, L] n Z maximal with ^(£2) < u 
else 


L\ i — -j-oo, L2 ^— —00. 

end if 

for all rm+i G [Li, L 2 ] O Z do // [Li,L 2 ] = 0 z/Li = +00 

20: ££ U {(m + 1, ri,..., Tm+i)} // actual branching 

end for 
end if 

else // all variables Xi were fixed to values rj 

24: if /(r) < u then // update upper bound 

X* •(— r 
u ^ f{r) 

end if 

28: end if 

end while 
output X*, u 

print / attains its integer minimum u at x*. 


Proposition 18. 1. Algorithm^is correct, that is, it always terminates after a finite 

number of steps with an optimal integer solution x* that satisfies f{x*) = u. 
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2. The integers Li and L 2 (in lines 13 & 15) can be found with binary search in 
|'log2(L)] + 2 < |'log2(-R)] +2 evaluations of g if L > 0. 

Proof. Let x* be any optimal solution. To prove it suffices to show that the algorithm 
terminates and no problem with {n,x*) as subproblem gets pruned in Stepor lost in 
Step[^ To see termination of the algorithm, we observe that the number of subproblems 
is finite as the sets Bm = {y G \ ||y||p < R}, m = 1,... ,n are finite, every subproblem 
(m, ri,..., Xm) suffices (ri,..., Xm) G Bm and no subproblem is inserted into the list C 


more than once. To see that x* does not get discarded in Step 12 define 


g{xm+i) ■■= g{xl Xm+1 ) [hrrt+2l j • • ■ j [hn \ ) (^0) 

and suppose g{[hm+i~\) > u. Hence 

g{[hm+i\) >u> f{x*) > g{x*) > g{xl,.. .,x(^, [hm+i ],..., L^inl) = g{lhm+i]), 
a contradiction, where we used the monotonicity property of g (Proposition [ll| ) and 

f f I—I 

that g{x) < f{x) for x G C'!^{f{q)), a fortiori for x G £<(/(x*)). Suppose that x* gets 
lost in Step Necessarily, x)^^^ < Li or x*^j^i > L 2 . We derive a contradiction for 
< -^1) fhe other case is identical. Observe that x(^_^_l G [—L,L] as every optimal 

integer solution satishes must have = l(J\x*^j^f\P < 

{/W — \xWp — ... — \x)^\P. As x^_^_i is integer, we may round down - in other words, 
L,L]. By dehnition of Li and Proposition 11, we have g{x(^_^_l) > u with g 




from (20), thus, using Proposition 11 again 


9{x*m+i) >u> f{x*) > g{x*) > g{x*^+i), 


a contradiction. 

We finally show that Claim holds. We prove the claim for h^+i > 0, the proof for 
hk+i < 0 is similar. In case h^+i > L, Li exists if and only if g{L) < u as g{xk+i) 
is non-increasing for x^+i < h^+i (by Proposition 11); necessarily, L2 ■= L. Using 
binary search on [—L,L], Li can be found using at most [log2(2L)] = [log2(L)] -|- 1 


further evaluations of g. In case 0 < hk+i < L, Li exists as g{lhk+i~\) < u in Step 12 
Again using binary search, Li G [—L, [hfc+i]] can be found in no more than [log2(2L)] 
evaluations. As g{xk+i) = g{hk+i — Xk+\), it only needs at most one more evaluation 
of g to find L 2 , so we find both numbers in no more than [log2(L)] -|- 2 evaluations of 
~g. □ 


Remark 19. Concerning our implemenation, we chose degg = deg/ for [GLOB] and 
SLS and deg a = 2 for SLS, For the parameter h G M"" we chose an (approximate) 
continuous minimizer computed via the SOSTOOLS function findbound.m - however, 
the algorithm accepts arbitrary h G M”. We determined R using Algorithm 
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6.3.2. The initial lower bound on the minimum 

Before we compare our underestimators with lower bounds from the literature, we di¬ 
rectly evaluate our initial lower bound 5( [/i]) • To this end, we define a ratio Q as follows: 
Let /i be a continuous minimizer of / (if found by sos methods), x* an integer minimizer 
of / found during B&B and g he a solution to [GLOB] or |SLSl Then 

Q 9{[h]) - f{h) 

takes values in [0,1], is invariant under scaling of / by constants A > 0 and addition of 
constants c G M to / - and, needless to say, the larger Q, the tighter the lower bound. 
See Figure for the results. 
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Figure 4; Lower bound comparison using the ratio Q. 


By Theorem |SLS| gives bounds that are at least as good as |GLOB[ The plots show 
that SLS often gives strictly tighter bounds. 


6.3.3. Presentation of other bounds 

It is not straightforward to compare the performance of our lower bounds with bounds 
from the literature. In our setting, we compute a single underestimator per instance - 
which is then merely evaluated during the branch and bound process]^ We could not 
find other underestimators with this property that give sensible results in branch and 
bound. However, there are lower bounds in the literature that are more general than 
ours since they consider restricted polynomial optimization problems and can hence be 
applied to any polynomial - not only to those with positive definite leading form - 
and are suitable for branch and bound if computed anew at each node. In addition to 


Algorithm 2 (with GLOB and SLS) we implemented the following four algorithms in a 


®By fixing some variables at each node and then computing new underestimators, this could be improved 
but would need additional runtime for the computation of the new underestimator. 
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MATLAB framework for solving [TP} three of them are branch and bound approaches as 
Algorithm 2 which use other bounds (taken from [BD14] , |LHKWn6] , and the continuous 
relaxation) while our last algorithm is a simple brute force approach. 


For arbitrary polynomials on boxes, Buchheim and D’Ambrosio [BD14] suggested 
to compute, for every term of /, the L^-best separable underestimator. The sum 
of the underestimators is again separable, so its integer minimization is a uni¬ 
variate problem. For degree d < 4 and arbitrary n, they provide explicit un¬ 
derestimators. We hardcoded the explicit underestimators, and used the MAT- 
LAB builtins polyval, polyder and roots to evaluate and differentiate the sep¬ 
arable underestimators, and to compute their roots, respectively. As a suitable 
box at the subproblem V = (m,ri,... ,rm) we chose the box [—L, L]”"™ where 


L = 


^RP-\ri\P - 


The authors suggest to successively halve the 


box into subboxes which does not fit into our scheme. This approach is abbreviated 
SEP in the plots. 


• For nonnegative polynomials on polytopes P, De Loera et al. |LHKW06] approxi¬ 
mate the maximum of / on PnZ” by the sequence Each mem¬ 

ber of the sequence can be computed in polynomial time, using a reformulation as 
a limit of a rational function which in turn is based on the generating function of 
P. We did experiments with k = 2 and k = 4, the latter taking significantly longer, 
without giving much better results, so we restricted ourselves to fe = 2. Note that 
the suggested implementation uses residue techniques, while we just use symbolic 
limit computations. On the other hand, we improved the bounds as follows: To 
make their approach applicable to not necessarily nonnegative polynomials, the 
authors suggest to add the sufficiently large constant 

ll/lloll/llooM-* 


to obtain / = / + c nonnegative on P. Here, M > 0 is a bound on the poly¬ 
hedron s.t. \xi\ < M for all x G P; for / = we use the zero “norm” 

ll/llo := #{cK I fla 7 ^ 0} and the infinity norm ||/||oo := maXaHaal}. However, 
the constant c' := Yl'j=o suffices to ensure that f + c' is nonnegative on P. 

A short calculation shows that c' < c if M > 1, and in dense instances one often 
has c' <C c. As polyhedron we again chose the box [—L, L]”"”* from the previous 
bound. This bound is abbreviated to POLYFIX in the plots. 


• We compute an sos approximation of the global continuous relaxation (CR in the 
plots) at each subproblem P = (m, ri,..., r^), that is 


max A 

s.t. /(n, . . . ,rm,Xm+l, ■ ■ ■,Xn) - A is SOS in ..., Xn] 


• Brute force enumeration with no lower bounds, abbreviated BE. As / has to be 
evaluated at each node, we use matlabFunction to convert the Symbolic Math 
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Toolbox object that encodes / into a function handle that can be evaluated signif¬ 
icantly faster. 


Algorithm 2 using GLOB with parameters as described in Remark 19 


Algorithm 2 using SLS with parameters as described in Remark 19 


6.3.4. Runtime comparison 


The implementation of the six different algorithms from Section 6.3.3 into our B&B- 
framework gave the runtimes in Figure]^ (logarithmic scale). On every instance each of 
the lower bounds had a maximum of 5 minutes to complete; if this time constraint was 
violated, the process was interrupted and the lower bound considered as unsuccessful 
on this instance. If the parameter h could not be found by SOSTOOLS’ findbound.m 
function, 


GLOB and SLS were considered to have violated the time constraint. 






Figure 5: Runtimes in [s]. 
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We infer from the plots that for a small number of variables, the problem size (i.e., R), is 
mostly so small that brute force is often the fastest approach. However, if instances get 
larger, brute force fails necessarily as the processing time is linear in the number of nodes. 
SEP is quite fast in small instances, but for large instances the running time deteriorates 
as an underestimator is computed at each node. In our setting, POLYFIX takes too long 
to be competitive. The continuous relaxation is satisfactory for smaller instances but 
fails in some large instances. Concerning our bounds, in the two plots of Figure with 
n = 2, there is a surprisingly little variance in runtime for GLOB and SLS. This can be 
explained from a further plot, see Figure in which we break down the preprocessing 
time, i.e., the time needed to compute a approximate continuous minimizer h and the 
underestimator g, and the time needed for the actual branch and bound. It can be seen 
that the preprocessing time is more or less independent from the instance and takes in 
most instances signihcantly longer than the actual branch and bound. Also, it seems 
at first that SLS takes mostly longer than GLOB However, this holds only true for the 
preprocessing phase: The corresponding sos program is larger, and so are preprocessing 
times. Indeed, Figure [^reveals that GLOB has shorter preprocessing times throughout, 
but is inferior in B&B, as expected. 
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7. Conclusion and Outlook 

In this paper we presented a new way of finding underestimators for integer polynomial 
optimization and improved the bounds on the norm of integer and continuous minimiz- 
ers. We implemented both ideas within a branch &: bound approach showing how they 
improve its performance. 

Gurrently we compute one underestimator at the beginning of the branch and bound 
process which is used for generating lower bounds throughout the whole algorithm. In¬ 
stead, one could also compute a new underestimator at each node of the branch and 
bound tree. This would improve the bounds but due to the comparatively large compu- 
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tation time for solving an sos-program does not pay off in terms of overall efficiency. We 
currently analyze in which nodes the computation of a new underestimators improves the 
procedure. Along the lines of |BHS14| we plan to analyze how to find an underestimator 
which is likely to be a good one for all subnodes. 

We also point out that our procedure can be extended to mixed-integer polynomial 
optimization: The norm bounds apply in the mixed-integer case as well, and we may use 
the proposed class of underestimators, but with their mixed-integer minima (which are 
also simple to obtain). Our underestimators can in principle also be used for constrained 
polynomial optimization; however, it is subject to further investigation if the bounds 
provided are sharp enough for this case. We hence work on sos-programs which provide 
underestimators which are able to take into account given constraints. 


A. Computing the norm bounds 


In Remark we saw that we get a tighter norm bound R on the minimizers the closer 
the Cj get to their optimal value c*. In the following, we present two means that improve 
on the approach!^ in Section 4.1 that do not rely on sos programming. The second 
method we present is a refinement of the first. For both, we improve the norm bound 
R by replacing the estimate |x"| < 1 on with |x“| < where x is a continuous 


maximizer of the function S” ^ 


sr' 

X I—> 


A.l. A direct improvement 

One has the following closed form for the continuous minimizer x with nonnegative 
coordinates: 


Lemma 20. Let 0 7 ^ a G Nq and p G [l,oo). Then, the monomial A“ attains its 
maximum on at x with coordinates 


X 


i 



n. 


( 21 ) 


Proof. By a simple analysis, the proof can be reduced to a* > 1 for i = 1,..., n and 
then to maximization of A“ on {x G | xi > 0,..., x„ > 0}. Using the method of 
Lagrange multipliers, the claim follows from a short calculation. □ 


Observation 21. Denote by X(q,) the maximizer of on S” ^ 
X G we have 


as in (21). Hence for 


fji^) = ^ -|Qo 

\a\=j \a\=j 


■ (^(«))“ =: Cj. 


( 22 ) 


This Cj is as least as large as approac/i[^ from Section 4-1 since, for 0 / a, (^(a))“ < 1 
- unless A" G M[Ai] for some i, in which case X(q,) = Cj, the i-th unit vector, and thus 
(X(„))“ = 1. 
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A.2. A different approach 

This last approach on computing bounds cj is different to the ones before, as we actually 
compute 2^ norm bounds: We restrict / to each of the 2"' orthants 


-ffr = {a; £ I TiXi > 0} for r G {—1,1}" 


and compute norm bound on integer minimizers of every f\Hr- This has the advantage 
that, roughly speaking, we may neglect half of the terms of / = '^aaX°‘. Also, mini¬ 
mization on H-r can be reduced to minimization on i.e., the set of those x G M"" 

with X > 0, as we shall see in a moment. 

Introducing the notation |o|“ = | min(a, 0)| for a G M and with x from (21), we have for 
every term Oax" > — |ao 




Oax" > — IuqI x°‘, X G Sp ^ and x > 0, 

H=j H=j 

" -^ 


which means about half of the coefficients are neglected in comparison to (22), if signs 
are distributed equally among the Oa- Now let be the largest real root of 


i=i 

The verbatim argument of Theorem shows that /(x) > /(O) for ||x||p > ijT’ -'d) and 
X > 0. This bounds integer and continuous minimizers on Bounding the norm of 

minimizers of / on Hr, r G { — 1,1}”, can be reduced to bounding the norm of minimizers 
on by a simple change of coordinates. To this end, let r(x) = (nxi,... ,TnXn), 

X G M”, and /” be the polynomial 


fix) := /(r(x)) = ^ aar"x“, r G {-1,1}”. 

OL 


As r“ G {—1,1}, / and /” merely differ in the sign of their coefficients, and /^(x) > Cd 
still holds for x G as the sphere is r-invariant, that is r(Sp“^) = Similarly to 

before, denote by the largest real root of 


d-l 

q^X)=CdX‘^ + Y,c]X^ 

i=i 

with cj = — |aar“|“x". It is now clear that /’’(x) > /(O) for ||x||p > and x > 0, 
equivalently, /(x) > /(O) for ||x||p > i?” and x G Hr- 

This results in more effort in the preprocessing, but reduces the number of feasible 
solutions. 
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